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ABSTRACT 

In  this  paper,  a  new  technique,  the  multiple  ripple  propagation  technique,  is  presented 
for  mapping  an  h  Xw  grid  into  an  w  xh  grid  such  that  the  dilation  cost  is  2.  That  is.  such 
that  any  two  neighboring  nodes  in  the  first  grid  are  mapped  into  two  nodes  in  the  second 
grid  that  are  at  most  distance  2  apart.  This  technique  is  then  used  as  a  basic  tool  for  map¬ 
ping  any  rectangular  source  grid  into  a  square  target  grid  with  the  dilation  two  property 
preserved.  The  ratio  of  the  number  of  nodes  in  the  source  grid  to  the  number  of  nodes  in 
the  target  grid,  called  the  expansion  cost,  is  shown  to  be  always  less  than  1.2.  This  is  a 
noticeable  improvement  over  the  previously  suggested  techniques  in  which  the  expansion 
cost  could  be  bounded  by  1.2  only  if  the  dilation  cost  is  allowed  to  be  as  high  as  18. 


•)  This  work  is.  in  part,  supported  under  ONR  contract  N00014-80-C-045J5. 
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1.  Introduction 


In  this  research,  we  study  the  problem  of  squaring  up  a  rectangular  grid.  That  is. 
embedding  an  h  Xw  rectangular  grids  into  an  k  xk  square  grid,  where  k  ^|vhw  |,  and  [  |  is 
the  ceiling  function.  The  results  of  this  research  may  be  applied  to  the  VLSI  design  of 
highly  eccentric  circuits  that,  without  squaring  up.  would  have  to  be  laid  out  in  a  rectangu¬ 
lar  area  with  a  height/width  ratio  very  far  from  unity  [5.  8].  They  may  also  be  applied  to 
the  mapping  of  rectangular  problem  domains  (as  for  example  finite  element  grids  [3,6]) 
into  mesh  connected  architectures  [l,  7],  Mapping  of  rectangular  program  graphs  into 
hypercube  architectures  may  also  benefit  from  this  research.  Specifically,  it  has  been  shown 
[4]  that  this  mapping  may  be  accomplished  by  embedding  the  graph  into  a  square  graph 
which  is,  then,  mapped  easily  to  the  hypercube. 

Two  measures  may  be  used  to  estimate  the  quality  of  The  embedding.  The  first  meas¬ 
ure  is  the  expansion  cost.  E .  which  is  the  ratio  of  the  number  of  nodes  in  the  square  target 
grid  to  the  number  of  nodes  in  the  source  rectangular  grid.  That  is  E  —  k2/hw..  The  other 
measure  is  the  dilation  cost  D  .  which  is  a  measure  of  the  communication  penalty  that  has 
to  be  paid  due  to  the  squaring  up.  More  specifically,  if  a  link  X  in  the  source  grid  connects 
two  neighboring  nodes,  say  (i  ,j  )  and  (i  .y'+l).  and  these  two  nodes  are  mapped  to  the 
nodes  (i ' .j ')  and  (i  '+c,  ,j  ‘+Cj  )  in  the  target  grid,  then  the  dilation  of  the  edge  X  after  the 
embedding  is  defined  by  D(X)=lc,  l  +  lc;  I.  The  dilation  cost  of  the  embedding  is  then 
given  by  D  =max  D  (X). 

The  best  known  results  for  embedding  an  h  Xw  grid  into  the  smallest  possible  k  xk 
grid  are  given  in  [2],  where  different  embedding  methods  are  suggested  for  different  ranges 
of  the  eccentricity  ratio  p=w//i.  Assuming  that  25.  all  the  methods  suggested  in  [2] 
produce  embeddings  with  expansion  costs  smaller  than  1.2,  and  dilation  costs  ranging  from 
2  to  18,  depending  on  the  value  of  p.  Specifically,  the  dilation  cost  is  less  than  or  equal  to  3 
if  p  io  in  cne  of  the  ranges  (1.2],  (10/3.4],  (8,9],  or  (155, oo).  Otherwise,  the  dilation  cost  is 
larger  than  5. 

In  this  paper,  we  first  introduce,  in  Section  2.  the  multiple  ripple  propagation  tech¬ 
nique  which  may  be  u.ted  to  emb^d  an  h  Xw  grid  ir.tc  ar  w  vh  grid  with  expansion  cost  1 
and  dilation  cost  2.  This  basic  technique  is  then  used  in  Sections  3  and  4  to  embed  any 


I 

\m 
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rectangular  grid  with  p^4  into  a  square  grid.  The  idea  is  to  apply  the  ripple  propagation 
technique  to  carefully  chosen  subrectangles  of  the  rectangular  grid.  For  grids  with  p>4, 
the  ripple  propagation  technique  may  be  combined  with  the  well  known  technique  of  fold¬ 
ing.  This  is  described  and  analyzed  in  Section  5.  Finally,  in  Section  6.  we  summarize  our 
results  and  show  that,  it  is  always  possible  to  square  up  any  rectangular  grid  at  a  dilation 
cost  of  2  and  an  expansion  cost  less  than  1.2.  This  is  a  clear  improvement  over  the  results 
given  in  [2]. 

2.  A  multiple  ripple  propagation  technique 

The  purpose  of  the  technique  described  in  this  section  is  to  map  an  h  Xtv  grid,  which 
satisfy 

h  <  w  ^  2  h  (l) 

into  an  w  Xh  grid  with  unity  expansion  cost  and  with  dilation  cost  equal  to  2.  In  order  to 

accomplish  that,  the  w  nodes  in  each  row  in  the  original  grid  should  be  compressed  to 
occupy  only  h  columns.  For  this,  we  let  l  =  w—  h  .  and  compress  21  nodes  from  each  row 
into  l  columns  by  repeated  rippling.  The  remaining  s  =  w  —21  =  2 h  —  w  nodes  are  left 
uncompressed.  In  Figure  1(b).  we  show  the  grid  of  Fig  1(a)  after  compressing  each  of  its 
rows.  As  shown  in  the  figure,  the  positions  of  the  l  ripples  in  each  row  are  chosen  as  fol¬ 
lows:  In  the  first  row.  the  l  ripples  are  grouped  to  the  right,  and  in  the  last  row,  the  l  rip¬ 
ples  are  grouped  to  the  left.  At  each  row.  one  of  the  ripples  that  was  grouped  to  the  right 
in  the  previous  row,  starts  its  propagation  to  the  left  (moves  one  column).  The  propagation 
of  that  ripple  continues  at  a  rate  of  one  column  every  row  until  it  can  no  longer  propagate. 
The  propagation  of  the  ripples  is  very  similar  to  the  motion  of  the  legs  of  a  walking  worm. 

Figure  1(b)  is  laid  out  to  occupy  w  +r  rows  and  h  columns.  However,  it  may  be 
noticed  that  s  positions  in  each  column  is  not  utilized.  This  allows  for  the  compression  of 
Fig  1(b)  into  an  w  xh  grid  which  has  a  dilation  cost  equal  to  2  (see  Fig  1(c)).  In  order  to  be 
more  formal,  we  let  F(i  ,j  )=(u  (i  ,j  ).v  (i  ,j  ))  be  the  function  which  maps  each  point  (i  ,j  ) 
in  the  source  grid  to  a  corresponding  point  (u  (i  ,j  ).v  (i  ,j  ))  in  the  target  grid.  For  any  node 
(l.y )  in  the  first  row  of  the  source  grid,  the  mapping  function  F  is  defined  as  follows: 


y 
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Fig  1  -  Embedding  an  11X16  grid  into  an  16x11  grid 
using  multiple  ripple  propagation 


«(!.;)  = 


v  (l.y  )  = 


1 

for  j  =1 . j 

1+rem  J--~- 
2 

for  j  =  r  +1 . w 

j 

for  j  =1 . s 

for  j  =s  +1 . w 

Where  rem  ()  is  the  remainder  of  the  integer  division.  The  function  F  may  then  be  defined 
recursively  such  that,  for  any  node  (i  ,j )  not  in  the  first  row.  F(i  ,j  )  is  specified  in  terms 
of  F(i—l,j).  In  order  to  simplify  the  recursive  definition  of  F .  we  partition  the  source 
grid  into  four  regions  as  shown  in  Fig  2  and  Fig  3,  and  we  use  different  recursive  formulas 
for  different  regions.  Specifically. 


u(i  ,j)  = 


v  (t  )  — 


u  (i  —  l.j  )  +  1 
u  ( i  —  l.j  )  +  2 
u(i—  l.j)  +  A u  3(i  .j ) 
u(i-l.j)  +  A„  4(i  ,j  ) 

v(i-l.y)  ' 
v(i-l.y) 

v  (i -l.j  )  -  Av  3(i  ,j  ) 
v  (i  —1,J  )  —  Av  4(i  J) 


if  (i  ,j  )  €  Region  1 
if  (i ./  )  €  Region  2 
if  (i ,/  )  e  Region  3 
if  (i  .j  )  €  Region  4 


if  (i ._/ )  e  Region  1 
if  (i  .j  )  €  Region  2 
if  (i  .J)  €  Region  3 
if  (i  .j  )  €  Region  4 


Where.  Au  3  and  4 .  3  depend  on  the  remainder  r(i.j)  =  rem  ((  j  —s  +i  — 1)/3).  Specifically. 


iV  •'»  »*»  I 
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Given  the  above  formulas,  the  following  theorem  proves  that  the  dilation  cost  of  the 
mapping  F  is  at  most  two. 

Theorem:  For  any  (t  ,j ).  where  i  .j  >  1.  the  following  is  true 

\u  (i ,']  )  —  u  (i  —  l.j  )  I  +  I  v  (£ ,j )  —  v  (t  —  l.j  )  I  ^2  (4. a) 

and 

\u  (i  ,j  )  —  u  (i  .j  —  1)  I  +  I  v  (t ._/ )  —  v(t  .j  —  1)  I  ^2  (4.b) 

That  is.  any  two  adjacent  nodes  in  the  source  grid  are  mapped  into  two  nodes  that  are  not 
more  than  a  distance  two  apart  in  the  target  grid. 

Proof:  The  proof  of  (4.a)  is  straight  forward  if  (i  ,j  )  is  in  Region  1  or  Region  2.  If  (t .j )  is 

in  region  3,  then  the  left  side  of  (4.a)  reduces  to  I  A u  3(i  .j)  I  +  I  Av  3(i  ,j  )  I ,  which  is  equal 

to  1  if  r  (£  ,j  )=2,  and  to  2  otherwise.  The  case  (i  ,  j  )e  Region  4  is  similar. 

To  prove  (4.b)  we  use  induction  on  i.  For  £=1,  the  proof  is  by  direct  substitution 
from  equations  (2).  Next,  assuming  that  (4.b)  holds  for  i—  1,  we  should  show  that  it  also 
holds  for  i .  Again  the  induction  proof  is  straight  forward  if  (i  .j  )  is  in  Regions  1  or  2,  and 
is  similar  if  (£  ,j )  is  in  Regions  3  and  4.  For  this  reason  we  will  consider  in  the  rest  of  this 
proof  only  the  case  in  which  (i  ,j  )e  Region  3.  For  this  case,  we  will  prove,  by  induction,  a 
more  restrictive  form  of  (4.b),  namely 


u(£  ,j  — 1)  = 

v(£ ./')  -  v(i  ,j  — 1)  = 


(5.a) 

(5.b) 


—1  ifr(i./')=2 

1  if  r  (i  ,  j  )=0  or  1 

0  if  t-  (£  ,j  )=2 

1  if  r  (£  ,j  )=0  or  1 

For  i  =2.  equation  (5)  is  proved  directly  from  equations  (2)  and  (3).  To  prove  the 

induction  step,  we  notice  that  if  (t  .j )  is  in  Region  3,  then  (£ 1)  is  either  in  Region  3  or 

in  Region  1.  We  first  assume  that  (i  .j— 1)  is  in  Region  3  and  use  (3)  to  obtain 

u(i  ,j)  —  u  (i  ,/-l)  =  u(  i  -l.j)  —  u(i  —  l.j—  1)  +  Aa  3(i  ,j  )  -  Au  3(i l)  (6.a) 

v  (i  ,j  )  —  v(i  ,j—  1)  =  v(i -l.j  )  —  v(t  —  l.j—  1)  -  Av  _3(i  .j  )  +  A  v  3(i  ,j—l)  (6.b) 

If  r  (£,/)= 2,  then  Aa  3(i  .j  )=0  and  Au3(i.y— 1)=2  because  r(i,j— 1)=1.  Also, 

r  (i  —  l./)=l.  which,  from  the  induction  hypothesis,  gives  u  (i  —  l.j  )— u  (i  —  l.j  —  l)=l. 

Therefore,  from  (6.a)  we  have  u  (t  .j  )— u  (i  ,j  —  l)=l+0— 2=— 1.  Similarly.  Av  3(i.y')=l. 
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v  (i  .j  )— v  (i  ,j  — l)=l— 1+0=0.  The  same  type  of  argument  applies  if  r  (i  ,j  )=0  or  1. 

Finally,  if  (i J  —  1)  is  in  Region  1.  then  j  =s  —i  +2  and  thus,  r  ( i  ,j  )=1.  From  (3. a),  we 
get  u  (:'  ,J  )— u  (i  ,j  —  1)  «  u  (i  —  l.j  )— u  (i  —  l.j — 1)+2— 1  and  from  (3,b)  we  get 

v  (i  ,j  )— v(i —  1)  =  v(i—  1J  )— v  (i  —  l.j—  1).  But  both  U  —  l,y)  and  (i—  1J—  l)  are  in 
Region  1,  and  thus  u  (i  —  l.j  )=u  (i  —  l.j  —  1).  and  v  (i  —  l.y  )=v  (t  —  l.j  — 1)+1,  which  proves 
(5.a)  and  (5.b).  respectively .[] 

The  above  theorem  proves  that  it  is  possible  to  map  an  h  Xw  grid,  h  <w  $2/i .  exactly 
into  an  w  Xh  grid  with  dilation  cost  2.  It  is  also  possible  to  concatenate  the  w  xh  target 
grid  with  its  symmetric  image  (reflected  across  the  line  v  =h  )  to  obtain  an  exact  embedding 
of  an  h  X2w  grid  into  an  w  X2h  grid  with  dilation  cost  2.  Along  the  same  line  of  thinking, 
an  h  x2w  +1  source  grid  may  be  divided  into  an  h  Xw  +1  and  an  h  Xw  subgrids.  These  two 
subgrids  may  then  be  embedded  into  an  w+lX/i  and  an  w  xh  grids,  respectively,  and  by 
concatenating  the  former  with  the  symmetric  image  of  the  latter,  we  may  obtain  an 
w  +lx2h  target  grid.  The  dilation  cost  at  the  line  of  concatenation  may  be  shown  to  be  at 
most  two.  Grid  concatenations  of  the  type  described  here  will  be  used  repeatedly  and 
tacitly  in  the  rest  of  this  paper. 

In  the  following  sections,  we  apply  the  above  technique  to  our  original  problem  of 
mapping  an  h  Xph  rectangular  grid  (p  is  assumed  to  be  greater  than  unity),  into  a  square 
grid.  First,  two  basic  methods  are  introduced  for  grids  with  p^4.  These  methods  are  then 
combined  with  folding  and  applied  effectively  to  the  embedding  of  any  grid  with  p>4. 

3.  The  method  of  exact  row  fitting. 

Let  k  ,  k  >h  be  the  dimension  of  the  square  grid  (called  the  target  grid)  onto  which 
the  given  h  xph  grid  (called  the  source  grid)  is  to  be  mapped.  Of  course,  it  is  desirable  to 
chose  the  smallest  possible  k  in  order  to  minimize  the  expansion  cost  E  =  k2/ph2.  Given 
such  a  k  ,  the  method  of  exact  row  fitting  assumes  that  the  right  most  h  Xk  subgrid  of  the 
source  grid  may  be  mapped  exactly  into  the  k  Xh  right  most  subgrid  of  the  target  grid  (see 
Fig  4).  This  is  possible  if  and  only  if  condition  (l)  is  satisfied.  That  is 

h  ^  k  <  2h  (7.a) 


Moreover,  if 


->  4- 

k-h 


4 - h  - J> 


(a)  (b) 

Fig  4  -  Embedding  an  7X11  grid  into  an  9x9  grid 
using  the  method  of  exact  row  fitting 

k  —  h  ^  ph  —  k  (7.b) 

that  is,  the  number  of  columns,  k  —h  .  remaining  in  the  target  grid  is  at  least  equal  to  the 

number  of  columns,  ph  —k  ,  remaining  in  the  source  grid,  then  these  columns  may  be 
mapped  in  a  trivial  way  with  dilation  cost  2.  In  other  words,  the  mapping  may  be  com¬ 
pleted  with  dilation  cost  2  provided  that  the  size  of  the  target  grid,  k  .  satisfies  the  condi¬ 
tions  (7,a/b). 

The  solution  of  the  inequalities  (7.a/b)  may  be  found  by,  first,  computing  the 
minimum  k  that  satisfies  (7.b),  and  then  checking  that  this  value  is  consistent  with  (7.a). 
Specifically.  (7.b)  is  satisfied  if 


k  =  Kr  =  [  h  |  (8) 

It  is  straight  forward  to  check  that  the  value  of  k  given  by  (8)  satisfies  (7. a)  if 
p<3  —  1  /h  .  Hence,  the  method  of  exact  row  fitting  may  be  applied  only  if  p^3  —  l/h . 
Noting  that  Kr  ^  ((p+l)h  +l)/2.  we  may  obtain  an  upper  bound  on  the  expansion  cost  of 
the  resulting  embedding.  Namely. 


Er  =^-<Er, 
P 


(P+1+-J-)2 
_ n 

4p 


(9) 


The  value  of  Er  increases  monotonically  with  p  for  p>  1  +  1  /h  ,  and  hence  may  exceed 
1.2  for  large  values  of  p.  For  example,  assuming  h  =12,  then  Er  >1.2  if  p> 2.06.  Moreover. 


if  p>3— \/h  ,  the  method  may  not  be  applied.  In  these  cases,  the  method  of  exact  column 
fitting,  described  in  the  following  section,  can  be  used. 

4.  The  method  of  exact  column  fitting 

The  embedding  technique  used  in  this  section  is  based  on  the  vertical  disection  of  both 
the  source  and  the  target  grids,  each  into  two  subgrids  which  are  as  equal  as  possible.  Each 
of  the  source  subgrids  is  then  embedded  into  the  corresponding  target  subgrid  in  a  way  that 
ensures  that  all  the  columns  of  the  target  grid  are  efficiently  used.  In  order  to  deal  with  the 
case  of  ph  being  an  odd  integer,  the  number  of  columns  in  the  two  source  subgrids  is  taken 
to  be  \ph  /2|  and  \ph /2\.  respectively,  where  |j  is  the  floor  function.  For  the  same  reason, 
the  number  of  columns  in  the  target  subgrid  is  divided  into  \k  /2|  and  |/fc  /2]  columns,  respec¬ 
tively  (see  Fig  5). 

The  optimal  si2e.  k  =KC .  of  the  target  grid  should  be  determined  by  the  embedding  of 
h  x| ph  /2|  -*  k  x[£  /2]  or  the  embedding  h  x|p/i  /2j  -*  k  x[&  /2).  whichever  gives  a  more  strict 
condition  on  k  .  It  turns  out  that  the  latter  embedding  is  more  restrictive  than  the  former, 
and  hence,  should  be  used  to  derive  k  .  In  the  remaining  of  this  section,  we  will  denote  the 
h  x[p/i  /2|  grid  by  G, .  and  the  k  x[£  /2|  grid  by  G, ,  and  we  will  describe  an  embedding  of 
Gj  into  G, .  The  embedding  of  the  other  half  of  the  source  grid  (the  h  x\ph  /2|  subgrid)  into 
the  other  half  of  the  target  grid  (the  k  x[fc  /2]  subgrid)  may  be  accomplished  in  a  similar 
fashion. 
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Fig  5  -  Embedding  an  7x15  grid  into  an  11X11  grid 
using  exact  column  fitting 


Consider  the  upper  [&  /2]  X  \ph  /2)  subgrid  of  Gs ,  and  embed  it  into  the  upper 
\ph  /2j  X  [£  /2)  subgrid  of  G,  using  the  rippling  propagation  technique  of  Section  2.  In  order 
to  accomplish  this  embedding  we  should  have 

[k  /2]  ^  h  (10. a) 

and  condition  (l)  should  be  satisfied,  namely 

\k  /2j  <  \ph  /2j  <  2  \k  /2\  (lO.b) 

With  this,  each  of  the  remaining  h  —  [&  /2j  rows  in  Gs  may  then  be  compressed  to 
have  the  same  pattern  as  the  last  row  of  the  | ph  /2j  X  [£  /2j  -*  |&  /2j  X  | ph  /2|  embedding. 
This  results  in  a  dilation  cost  equal  to  2  and  requires  2{h  —  [£  /2|)  additional  rows  in  G, . 
Thus,  the  following  should  be  satisfied 


k  -\ph/2\  ^  2  {h  ~[k/2\)  (10 .c) 

N’oting  that  (x  +  l)/2^|x  /2j^(x  — 1)/2.  we  may  calculate  the  minimum  value  of  k  which 
always  satisfies  (lO.c).  Namely. 

k  =  Kc  =\Ph/2  +  2h  +  1_|  (11) 

This  value  of  Kc  satisfies  the  conditions  (lO.a)  and  (lO.b)  as  long  as  p< 4. 

With  the  value  of  k  given  by  (11).  the  two  halves  of  the  source  grid  may  be  success¬ 
fully  embedded  into  the  two  halves  of  the  target  grid  with  dilation  cost  2.  N’oting  that 
|;t  /4]^(x  +3)/4  .  it  is  possible  to  bound  the  expansion  cost  of  the  embedding  as  follows: 


Ec 


$  E, . 


(p+4+|-)2 

_ n 

I6p 


expansion  cost 


Fig  6  -  Expansion  cost  Vs  eccentricity  for  1  <p^4  (h  =20) 


(12) 


For  l<p^4,  the  value  of  Ee  is  monotonically  decreasing  with  p,  which  suggests  the 
use  of  the  method  of  fitting  columns  whenever  the  method  of  fitting  rows  fails  to  satisfy 
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Er  ^1.2  (see  Fig  6).  The  critical  value  of  p  that  determines  which  of  the  two  methods  has  a 
smaller  expansion  cost  may  be  found  by  solving  Er  =EC  .  From  (9)  and  (12),  this 
gives  p  =  2+3 /h  .  The  expansion  cost  at  this  value  of  p  is  (3 h  +4)2/(4/i  (2 h  +3)),  which  is 
always  smaller  than  1.2  if  h  >  17, 

Hence,  for  l<p^4,  the  most  efficient  embedding  method  depends  on  the  value  of  p. 
Specifically,  if  p^2+3/7i ,  then  the  methods  of  exact  row  fitting  should  be  used,  otherwise, 
the  method  of  exact  column  fitting  should  be  used.  For  values  of  p  larger  than  four,  the 
above  methods  can  be  combined  with  the  known  method  of  folding  [2]  as  described  in  the 
next  section. 


5.  Combining  ripple  propagation  with  folding. 

If  p  =  (q  +  l)2  for  some  integer  q  ^  1.  then  the  source  grid  may  be  folded  q  +1  times  to 
fit  exactly  an  (q  +  l)h  X  ( q  +  l)/i  target  grid.  In  fact,  it  is  easy  to  show  that  if 


"^1^2  ~  <  ?  ^  (13.a) 

for  some  integer  q  >1.  then  folding  the  source  grid  into  an  (q  +l)h  X  (q  +l)h  target  grid 

will  result  in  an  expansion  cost  less  than  1.2.  In  Fig  7,  we  illustrate  the  technique  of  fold¬ 
ing  by  an  example.  As  clear  from  this  figure,  successive  tracks  (a  track  consists  of  h  con¬ 
secutive  rows  of  the  target  grid)  are  joined  by  two  h  Xh  corner  tiles  that  guarantee  a  dila¬ 
tion  cost  equal  to  two. 


As  described  above,  folding  may  result  in  few  unused  columns  in  the  last  track  of  the 


target  grid,  and  condition  (13. a)  limits  the  number  of  these  unused  columns.  It  is  also  pos¬ 
sible  to  apply  folding  and  leave  some  rows  of  the  target  grid  unused.  More  precisely,  if  the 
eccentricity  of  the  source  grid  satisfies 

q2  ^  p  ^  1.2^2  (13. b) 

then,  it  is  possible  to  fold  this  grid  into  an  ph /q  X  ph  !q  target  grid.  This  will  leave 
(rho  —q2)h  /q  unused  rows  in  the  target  grid,  and  condition  (13. b)  will  guarantee  that  the 
number  of  unused  rows  does  not  exceed  O.lqh  .  Thus,  the  expansion  cost  will  be  less  than 
1.2. 

5.1.  Combining  folding  with  exact  row  fitting 

Consider  an  h  Xph  source  grid  which  satisfies  1.2 q2  <  p<  (.g  +1)2/1.2.  Clearly,  fold¬ 
ing  this  grid  into  a  square  grid  is  too  expensive  (expansion  cost  larger  than  1.2)  because  nei¬ 
ther  (13.a)  nor  (13. b)  is  satisfied.  In  this  section,  we  introduce  a  method  which  combines 
folding  and  exact  row  fitting.  This  method  will  be  denoted  by  FR.  In  order  to  describe  the 
FR  method  we  assume  that  the  source  grid  is  to  be  embedded  into  a  target  grid  of  size  k  , 
where  k  satisfies  qh  < k  ^(?+l)/i.  The  embedding  starts  by  folding  the  source  grid  into 
the  target  grid  q  times  as  shown  in  Fig  8(a).  Clearly,  the  right  most  h  X  ( ph  —qk  )  subgrid 
of  the  source  grid  will  not  fit  into  the  target  grid,  and  the  last  k  —qh  rows  of  the  target  grid 
will  be  unused.  The  idea  is  to  consider  the  last  track  resulting  from  the  folding  (an 
h  x  ph  —(q  —l)k  grid  denoted  by  Gs ).  and  to  squeeze  it  into  an  k—(q—\)h  Xk  grid 
(denoted  by  G,  )  that  fits  the  target  grid. 


k 


(»)  (b) 

Fig  8  -  Combining  folding  with  exact  row  fitting  ( q  =4.  and  ^=4) 


The  squeeze  is  performed  by  partitioning  Gs  vertically  into  P  subgrids  G,  0.  •  •  •  . 
Gj  and  partitioning  G ,  vertically  into  P  subgrids.  G,  0.  •  ■  •  .  Gt  j>-\,  and  then  mapping 
each  Gs  ,  into  the  corresponding  G,  , .  The  partitioning  of  G*  is  such  that  G,  >0.  is  an  h  xh 
grid  and  each  remaining  G,  , .  i  =1,....P—1  is  an  h  X  [(ph  —  (q  —  l)fc  —  h  )/(/*— 1)|  grid.  Note 
that  if  ph—(.q—l)k—h  does  not  divide  P—1  then  Gsj>-i  will  have  few  empty  columns. 
Similarly,  the  partition  of  G,  is  such  that  G,  0  is  an  k —(q —l)h  X  h  grid  and  each  of  the 
remaining  G,  * .  i  =1 . P  —1  is  an  k  —(q  —l)h  X  |(&  —h  )/{P— 1)]  grid. 

The  method  of  exact  row  fitting  introduced  in  Section  3  is  used  to  map  each  Gs  , . 

i  =1 . 9—1.  into  the  corresponding  G,  , .  As  for  the  mapping  Gf  0  G,  0.  it  should  ensure 

that  the  transition  from  track  q  —1  to  track  q  does  not  increase  the  dilation  cost  beyond 
two.  This  may  be  accomplished  by  expanding  the  h  xh  corner  tiles  (see  Fig  7)  into  an 
k  —(q  —l)h  xh  pattern  that  fits  G,  0  such  that  the  distribution  of  the  h  nodes  in  the  last 
column  of  G,  0  is  similar  to  the  distribution  of  the  h  nodes  in  the  first  column  of  G,  j.  It 
may  be  shown  that  such  an  expansion  is  always  possible  with  dilation  cost  not  exceeding 
two. 


<a>  (6) 

Fig  9  -  Mapping  Gs  ,  into  G,  ,  in  the  FR  method 


In  order  to  compute  the  optimum  size  k  of  the  target  grid,  we  follow  the  same  reason¬ 
ing  as  in  section  3.  Specifically,  the  method  of  exact  row  fitting  may  be  used  for  mapping 
any  Gs  ;  into  the  corresponding  Gt  i.  For  this,  the  following  conditions  should  be  satisfied 
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h  <S  k  -(q-l)h  «$  2 h 

\kjdLl_h  >  |  (P-D^i  -(g-l)^i  _  (Ir  _(n  _ 


(k-(q-l)h  ) 


(14. b) 
(I4.c) 


In  order  to  solve  the  above  system  of  inequalities,  we  first  find  the  minimum  value  of 
k  which  always  satisfies  (I4.c).  This  value  is: 

*  =A>  =  I  <£±ii£zlMii<£z21 1  (15) 

By  substituting  (15)  in  (l4.a)  and  (I4.b),  we  conclude  that  these  two  conditions  are 
satisfied,  respectively  if 


h 


p  ^  q  (q  + 1)  +  P  —1  — 


3 P  +o  -6 


(l6.a) 


(l6.b) 


Hence,  given  any  rectangular  grid,  the  method  may  be  used  if  there  exists  a  P  that 
satisfies  (16).  The  number  of  partitions  P  also  affects  the  expansion  cost.  More  precisely, 
from  (15).  we  find  that  K p  <((ph  +q  (P— l)h  +3 P  +q  — 6)/(q  +/*— l),  which  may  be  used  to 
bound  the  expansion  cost  by 

_  (p+q(P-l)  +  (3P+q-6)/hy 

E"  "  W+q-lvl -  (17> 

The  derivative  /nax  /ftP  is  negative  for  p^q2  which  means  that,  from  the  point  of  view 
of  minimizing  Ep .  it  is  advantageous  to  find  the  maximum  P  which  satisfies  (16).  For 
P  >q  ,  the  two  conditions  (I6.a)  and  (l6.b)  may  not  be  satisfied  simultaneously.  If.  how¬ 
ever.  p  is  in  the  range 

q2  +  (g  ~2)^2g  ~3)  **  p  (q+l)2-2-  ,  (18) 

h  n 

then.  (16.a/b)  are  satisfied  for  P —q  ,  and  hence  the  embedding  may  be  completed  with  q 
partitions  in  a  target  grid  whose  size  is  given  by  (15).  The  maximum  expansion  cost  may 
then  be  found  by  substituting  P  =q  in  (17)  to  obtain 


Efr  ^  Efr^x 


_  ( p+q  ( q  —1)  +  (4 q  — 6)//i  )2 


7  (2  q-l)2p 

In  Section  6.  it  will  be  shown  that,  for  q  ^3.  Ep  is  smaller  than  1.2  for  any  p  in  the 
range  specified  by  (18).  and  that,  outside  that  range,  p  satisfies  (13.a)  or  (13. b).  which 
means  that  folding  may  be  used  with  expansion  cost  less  than  1.2.  The  case  q  =2.  however. 


is  slightly  more  complicated.  For  instance,  if  h  &20,  then,  folding  is  too  expensive  in  the 
range  5.85  ^p^  7.5  (expansion  cost  is  larger  than  1.2).  Also  in  thst  range,  the  FR  method 
either  does  not  apply  (if  p>6.9)  or  gives  Ep.  >  1.2  (if  5.85<p<6.9).  In  this  case,  combining 
folding  with  the  method  of  exact  column  fitting  (the  FC  method)  turns  out  to  be  useful. 
Although  we  only  need  this  combination  for  q  =2.  the  FC  method  will  be  described  in  the 
next  section  for  general  q  .  The  reason  for  doing  so  is  that  for  q  ^3.  although  both  the  FC 
and  the  FR  methods  realize  an  expansion  cost  less  than  1.2.  it  will  be  shown  that  the  FC 
method  gives  better  results  than  the  FR  method  for  some  subranges  of  p. 


5.2.  Combining  folding  with  exact  column  fitting 

In  this  method,  denoted  from  now  on  by  FC.  the  source  grid  is  folded  into  the  target 
grid  as  described  in  the  previous  section,  and  also  each  of  Gs  and  G,  is  partitioned  into  P 
subgrid.  The  FC  method  is  different  from  the  FR  method  in  that  each  subgrid  Gt  i. 

i  =1 . P— 1.  is  mapped  into  the  corresponding  Gt  ,  using  the  method  of  exact  column  fitting 

rather  than  exact  row  fitting. 

The  conditions  that  have  to  be  satisfied  in  order  to  map  Gr  ,  into  G,  ,  using  exact 
column  fitting  are  analogous  to  the  conditions  (lO.a/b/c)  of  Section  4.  Specifically,  these 
conditions  are  (refer  to  Fig  10): 


I  k~h  |  ^  i(p—  l)/i—  (a—  l)fc,  <  ~\k—h\ 


*  -(9  -m  -  *2  (h  -  [^j 


(20.a) 


(20. c) 


The  same  technique  that  was  used  in  the  last  sections  is  applied  to  the  solution  of  the 
above  inequalities.  From  (20.c),  the  minimum  size  of  the  target  grid  is  found  to  be 

k  =  Kfc  =  \{p+* +  3(/>— 2)  |  (21) 

By  substituting  (21)  in  (20. a)  we  obtain  the  condition 

p  <  P2~P+q  +  (P  +?  —3)(P  —2)/h 

which  may  be  satisfied  only  if  P>q.  Also,  by  using  (21)  to  compute  the  expansion  cost 
EfC  .  and  then  differentiating  the  resulting  formula,  we  find  that  QEp/QP  is  positive  for 
p^P2—P+q  .  This  means  that  using  P—q  +1  partitions  will  give  the  best  expansion  cost. 
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Fig  10  -  Mapping  Gs  j  into  G,  ,  in  the  FC  method 
Now  using  P-q  +1  in  (21),  and  substituting  the  result  in  (20.a)  and  (20.b).  we  find  that 


these  conditions  are  satisfied  if  p  lies  in  the  following  range 


q2+ 1  +  ^~1}-  <  p  ^  (q  +  l)2-l  - 


That  is,  the  FC  method  may  be  applied  if  p  satisfies  (22).  The  expansion  cost  may  then  be 
computed  from  (21)  with  P  =q  +1.  The  upper  bound  on  this  cost  is  given  by 


Efc  ^  Efc  snax 


_  (p+9  (9  +  1)+1  +  ( 5q  —3  )/h  )2 


(2g  +  l)2  p 


6.  Discussion  and  conclusion 


Given  an  hxph  source  grid,  let  q  be  the  integer  tftat  satisfies  g2^p<(g+l)2.  For 


9=1.  it  has  been  shown  in  Sections  3  and  4  that  the  mapping  of  the  source  grid  into  a 


square  rectangular  grid  may  be  accomplished  by  using  the  method  of  exact  row  fitting  if 
p^2+3/h  ,  or  the  method  of  exact  column  fitting  if  p>2+3/h  .  In  both  cases,  the  expansion 


cost  is  proven  to  be  less  than  1.2. 


For,  9^2.  the  FR  or  the  FC  methods  described  in  Section  5  may  be  applied  provided 
that  Pi  <  p<p2,  where  the  critical  values  pj  and  p2  are  specified  from  (18)  and  (22). 


lNWW' 


Namely, 


Pi  =  ?2  + 


p2  =  ( q  +  l)2  —  1  - 


(24.a) 
(24. b) 


In  order  to  determine  which  of  the  two  methods  gives  a  smaller  expansion  cost,  we  notice 


from  (19)  and  (23)  that,  for  p>0,  Efrwax  and  Efc  /mx  intersect  at  only  one  point,  namely 


P3  =  g  (?  +D  ~  y  +  ^  ~  (24.c) 

We  also  observe  that  both  E p  and  £/c  /nax  are  of  the  form  /  (p)=(p+a  )2/(6  p).  for  some 

constants  a  and  b  .  Given  that,  for  p> 0.  the  function  /  has  only  one  local  minima  at  p-a  , 
we  may  determine  that  Efrwa*  has  its  local  minima  at  Pp  =q  (q —l)+(4q —6)/h  .  and 
Efc  wx  has  its  minima  at  pfc  =q  (?  +l)+l+(5g  —  3)/h  .  Clearly,  pp  is  smaller  than  p3  for 
h  >4,  and  pfc  lies  between  px  and  p2-  This  leads  to  the  conclusion  that  Efc/mx  <Ep  /mx  if 
p>p3  and  Efc/nax  >Efr/riax  if  p<p3.  In  Figure  11.  both  Efc/rulx  and  Ep  ^  are  plotted  for 
q  =2.  h  =20.  and  for  q  =4.  h  =10. 


Hence,  if  p  lies  between  p3  and  p3,  the  FR  method  is  recommended,  and  if  p  lies 
between  p3  and  p2.  then  the  FC  method  is  recommended.  If  this  strategy  is  applied,  then  the 
largest  expansion  cost  occurs  at  either  p=P2  or  p=p3.  By  direct  substitution  of  (24. b)  and 
(24.c)  into  (23),  and  after  simple  algebraic  manipulation,  it  may  be  shown  that,  for  h  ^18. 
the  value  of  Efc/nax  is  less  than  1.2  at  p2  and  p3. 


Neither  the  FR  method  nor  the  FC  method  may  be  applied  if  p  is  less  than  p3  or  larger 
than  p2.  However,  in  these  two  cases,  the  expansion  cost  resulting  from  simple  folding  is 
low  because  p  is  close  enough  to  q2  and  ($+l)2,  respectively.  In  fact,  if  p^Pj.  then 
p<q2(\+2/h  ),  which  satisfies  (13.b)  if  h  >  10.  Also,  if  p^p2,  then  p^(q  +l)2(l— 1  /qh  )— 1, 
which  satisfies  (13. a)  if  h  >10.  In  other  words,  the  application  of  simple  folding  in  these 
two  regions  will  result  in  an  expansion  cost  less  than  1.2. 

In  brief,  new  techniques  have  been  presented  and  analyzed  in  this  paper,  for  embed¬ 
ding  an  h  xph  rectangular  grid  into  a  square  grid  with  dilation  cost  equal  to  two.  The  most 
appropriate  technique  for  a  given  grid  have  been  shown  to  depend  on  the  size  of  that  grid, 
that  is  on  h  and  p.  By  adhering  to  the  selection  strategy  suggested  in  the  paper,  the  expan- 


16  P2  Ps  P2  25 

(b)  q  =4,  h  =10 

Fig  11  -  Expansion  cost  for  q2^p<(q  +1)2 


eccentricity 


sion  cost  is  guaranteed  to  be  smaller  than  1.2  if  h  is  larger  than  or  equal  to  18. 
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